Distributed Parallel Inference on Large Factor Graphs

نویسندگان

  • Joseph Gonzalez
  • Yucheng Low
  • Carlos Guestrin
  • David R. O'Hallaron
چکیده

As computer clusters become more common and the size of the problems encountered in the field of AI grows, there is an increasing demand for efficient parallel inference algorithms. We consider the problem of parallel inference on large factor graphs in the distributed memory setting of computer clusters. We develop a new efficient parallel inference algorithm, DBRSplash, which incorporates over-segmented graph partitioning, belief residual scheduling, and uniform work Splash operations. We empirically evaluate the DBRSplash algorithm on a 120 processor cluster and demonstrate linear to super-linear performance gains on large factor graph models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Gaussian Conditional Random Fields Based Regression for Large Evolving Graphs

Dynamics of many real-world systems are naturally modeled by structured regression of representationally powerful Gaussian conditional random fields (GCRF) on evolving graphs. However, applications are limited to small and sparse evolving graphs due to high computational cost of the GCRF learning and inference. In this study, a new method is proposed to allow applying a GCRF model to large and ...

متن کامل

Distributed MAP Inference for Undirected Graphical Models

Graphical models have widespread uses in information extraction and natural language processing. Recent improvements in approximate inference techniques [1, 2, 3, 4] have allowed exploration of dense models over a large number of variables. These applications include coreference resolution [5, 6], relation extraction [7], and joint inference [8, 9, 10]. But as the graphs grow to web scale, even...

متن کامل

Fast parallel conversion of edge list to adjacency list for large-scale graphs

In the era of bigdata, we are deluged with massive graph data emerged from numerous social and scientific applications. In most cases, graph data are generated as lists of edges (edge list), where an edge denotes a link between a pair of entities. However, most of the graph algorithms work efficiently when information of the adjacent nodes (adjacency list) for each node are readily available. A...

متن کامل

Inference of large phylogenetic trees on parallel architectures

Due to high computational demands, the inference of large phylogenetic trees from molecular sequence data requires the use of HPC systems in order to obtain the necessary computational power and memory. The continuous explosive accumulation of molecular data, which is driven by the development of cost-effective sequencing techniques, amplifies this requirement additionally. Furthermore, a conti...

متن کامل

Factor Graph Inference Engine on the SpiNNaker Neural Computing System

This paper presents a novel method for implementing Factor Graphs in a SpiNNaker neural computing system. The SpiNNaker system provides resources for fine-grained parallelism, designed for implementing a distributed computing system. We present a framework which utilizes available SpiNNaker resources to implement a discrete Factor Graph: a powerful graphical model for probabilistic inference. O...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009